3574 results found.
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC BY-SA
Size:
15 GByte Production Status:
Newly created-finished
Use:
Emotion Recognition/Generation
-
Paper title:MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations
-
Paper track:Long/Sentiment Analysis and Argument Mining
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Soujanya Poria | MELD | /N |
Documentation:
https://github.com/SenticNet/MELD
Written
Named Entity Recognizer,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
GNU General Public License v3
Size:
1 GByte Production Status:
Existing-used
Use:
Named Entity Recognition
-
Paper title:Enhancing Pre-Trained Language Representations with Rich Knowledge for Machine Reading Comprehension
-
Paper track:Long/Question Answering
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Quan Wang | Stanford Core NLP (NER) | /N |
Documentation:
https://stanfordnlp.github.io/CoreNLP/tutorials.html
Written
Corpus,
Language Type:
Bilingual
Languages:
Chinese English
Availability:
Freely Available
License:
LICENSEE, by Princeton University
Size:
10 MByte Production Status:
Existing-used
Use:
Knowledge Discovery/Representation
-
Paper title:Enhancing Pre-Trained Language Representations with Rich Knowledge for Machine Reading Comprehension
-
Paper track:Long/Question Answering
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Quan Wang | Wordnet | /N |
Documentation:
https://wordnet.princeton.edu/documentation
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
154 MByte Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:Using Human Attention to Extract Keyphrase from Microblog Post
-
Paper track:Short/Information Extraction and Text Mining
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yingyi Zhang | GECO | /N |
Documentation:
None
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Included language is licensed under CreativeCommons.
Size:
14 GByte Production Status:
Newly created-finished
Use:
Language and Vision
-
Paper title:A Corpus for Reasoning about Natural Language Grounded in Photographs
-
Paper track:Long/Vision, Robotics, Multimodal, Grounding and Speec
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Alane Suhr | Natural Language for Visual Reasoning for Real | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
500 GByte Production Status:
Existing-updated
Use:
Person Identification
-
Paper title:I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences
-
Paper track:4.7 Evaluation of speaker and language identificat/Poster Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kong Aik Lee | NIST SRE Evaluations | /N |
Documentation:
Yes
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None MByte Production Status:
Existing-used
Use:
semantic parsing
-
Paper title:Discourse Representation Parsing for Sentences and Documents
-
Paper track:Long/Document Analysis
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jiangming Liu | Groningen Meaning Bank | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
N/A
Size:
1.3 Million Documents OtherProduction Status:
Newly created-finished
Use:
Summarisation
-
Paper title:BIGPATENT: A Large-Scale Dataset for Abstractive and Coherent Summarization
-
Paper track:Short/Summarization
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Eva Sharma | BIGPATENT dataset | /N |
Documentation:
Yes, it is in English and available publicly at resource url.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Owner
License:
Size:
118 item descriptions OtherProduction Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:The Language of Legal and Illegal Activity on the Darknet
-
Paper track:Long/Applications
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Daniel Hershcovich | eBay-corpus | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
English German Spanish
Availability:
From Owner
License:
Size:
10000 Onion Addresses OtherProduction Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:The Language of Legal and Illegal Activity on the Darknet
-
Paper track:Long/Applications
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Daniel Hershcovich | DUTA-10K | /N |
Documentation:
None




